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Abstract 

When estimating the risk of a P&L from historical data or Monte Carlo simulation, the 
robustness of the estimate is important. We argue here that Hampel's classical notion of 
qualitative robustness is not suitable for risk measurement and we propose and analyze a 
refined notion of robustness that applies to tail-dependent law-invariant convex risk measures 
on Orlicz space. This concept of robustness captures the tradeoff between robustness and 
sensitivity and can be quantified by an index of qualitative robustness. By means of this 
index, we can compare various risk measures, such as distortion risk measures, in regard to 
their degree of robustness. Our analysis also yields results that are of independent interest 
such as continuity properties and consistency of estimators for risk measures, or a Skoroliod 
representation theorem for -i/^-weak convergence. 
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1 Introduction 

Let X denote the P&L of a financial position. When assessing the risk of X in terms of a monetary 
risk measure p it is common to estimate p{X) by means of a Monte Carlo procedure or from a 
sequence of historical data. When p is a law-invariant risk measure, a natural estimate for p{X) 
is given by Tlp{rh), where m is the empirical distribution of the data and Tip is the functional 
defined by 

TZp{p) = p{X) if X has law p; 

see, e.g., [1, 2, 3, 9, 31]. In this context, it was pointed out by Cont et al. [9] that it is important 
to consider the robustness of the risk functional TZp. Informally, robustness refers here to a certain 
insensitivity of the estimate with respect to deviations of m from the 'true', theoretical distribution. 
It will especially yield a stable behavior of the estimates when the estimation process is repeated 
periodically. Such a stable behavior is particularly desirable when p{X) serves in allocating the 
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economic capital required from a large position X, since altering the capital allocation may be 
costly. 

On a mathematical level, Cont et al. [9] use Hampel's [19] classical concept of qualitative 
robustness, which, according to Hampel's theorem, is essentially equivalent to the weak continuity 
of Tip. Consequently, it was pointed out in [9] that no risk functional Tip that arises from a law- 
invariant coherent risk measure p can satisfy Hampel's requirement of qualitative robustness, not 
even if p is simply the ordinary expectation of the loss. The results in [9] therefore seem to weigh 
heavily in favor of Value at Risk, since Value at Risk does essentially satisfy Hampel's notion of 
robustness. 

Our goal in this article is to point out that in risk measurement the use of Hampel's classical 
concept of qualitative robustness may be problematic and to propose and analyze an alternative 
concept based on [22]. Let us start by discussing two major drawbacks of Hampel's robustness in 
risk measurement. 

First, two P&Ls may have laws that are close with respect to the weak topology but still 
have completely different tail behavior. Qualitative robustness of Tip therefore requires that p 
is essentially insensitive to the tail behavior of a P&L. In the recent years of financial crisis, it 
has become apparent, though, that an faulty assessment of tail behavior can lead to a dramatic 
underestimation of the corresponding risk. 

Second, Hampel's robustness concept creates a sharp division of the class of law-invariant 
monetary risk measures into those for which Tip is 'robust' and those for which Tip is 'not robust'. 
The first class contains risk measures such as Value at Risk that are insensitive with respect to tail 
behavior of P&Ls whereas the second class contains the ordinary expectation and all law-invariant 
coherent risk measures [9]. But, as we will see, the distinction between 'robust' and 'non-robust' 
risk measures is artificial because there is actually a full continuum of possible degrees of robustness 
beyond the classical concept. So labeling a risk measure as 'robust' or 'non-robust' may give a 
false impression. 

In this article, we will analyze the robustness properties of law-invariant convex risk measures 
based on the refined notion of qualitative robustness that was proposed in [22] and is further 
developed here in Section 2.4. Instead of a sharp division into 'robust' and 'non-robust' risk 
measures, this notion allows us to assign a degree of robustness to most risk measures and to 
compare different risk measures in regard to their degrees of robustness. We thereby capture 
the natural tradeoff between robustness and tail sensitivity in risk measurement. The degree 
of robustness can be expressed numerically by the index of qualitative robustness proposed in 
[22]. This index takes values in [0,oo], with the respective extremes -|-oo and corresponding 
to Hampel's robustness and to full tail sensitivity. Some of our main results will show that a 
greater index of qualitative robustness implies greater robustness in a sense that is mathematically 
precise. We will also show how our index can be computed for distortion risk measures such as 
MINMAXVAR or Average Value at Risk (which is also called Expected Shortfall, Conditional 
Value at Risk, or TailVaR). 

Since we are interested in the tail dependence of convex risk measures, it is not sufficient to 
consider only bounded P&Ls. We therefore build on the analysis of Cheridito and Li [6], who 
observed that Orlicz spaces or Orlicz hearts are appropriate domains for convex risk measures 
when P&Ls are unbounded. We are particularly interested in the continuity properties of p and 
its corresponding risk functional Tip, and we find that an important role is played by the so-called 
A2-condition of the underlying Orlicz space. 

Our analysis also yields some results that are of independent interest. For instance, we obtain 
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a Skorohod representation theorem that hnks tjj-weak convergence of probabihty measures to norm 
convergence of random variables in Orhcz space. 

Our article is organized as follows. In Section 2.2 we prove the consistency of the estimator 
TZp{m) for law-invariant convex risk measures and general stationary and ergodic data. In Section 
2.3 we analyze the continuity properties of TZp and show that TZp basically inherits the continuity 
of the original risk measures if and only if the Orlicz space supporting p satisfies the A2-condition. 
In Section 2.4 we present our main results on the comparative and qualitative robustness of law- 
invariant convex risk measures p. In particular, we introduce our refined notion of robustness and 
the index of qualitative robustness, and we show that these notions are well-defined whenever the 
Orlicz space supporting p satisfies the A2-condition. In Section 2.5 we show that our results can 
be easily applied to distortion risk measures. 

In Section 3.1 we continue and strengthen the robustness analysis for general statistical func- 
tionals started in [22]. In particular, we state stronger versions of our Hampel-type theorem and 
its converse than those given in [22]. In Section 3.2 we state and prove our above-mentioned 
Skorohod representation result. Most other proofs can be found in Section 4 and the appendix. 

2 Statement of main results 
2.1 Setup 

Let (fi, J-", P) be an atomless probability space and denote by L° := J-", P) the usual class of 

all finitely- valued random variable modulo the equivalence relation of P-a.s. identity. Let X d 
be a vector space containing the constants. An element X oi X will be interpreted as the P&L 
of a financial position. We will say that a map p : A" — )■ M is a convex risk measure when the 
following conditions are satisfied: 

(i) monotonicity: p{X) > p{Y) for X,Y E X with X <Y; 

(ii) convexity: p{XX + (1 - X)Y) < Xp{X) + (1 - A)p(F) for all X,Y e X and X E [0, 1]; 

(iii) cash coercivity: p{—m) — y +oo when m G M tends to +oo. 

In the standard definition of a convex risk measure, cash coercivity is replaced by the stronger 
requirement of 

(iii') cash additivity: p{X + m) = p{X) — m for X E X and m G M. 

But it was argued in [15] that the requirement of cash additivity should be relaxed when interest 
rates are stochastic or ambiguous, or when bonds are subject to possible default. We will therefore 
not insist on cash additivity and work with the weaker axiom (iii) instead. Our examples, however, 
will be formulated within the framework of cash-additive convex or coherent risk measures. 

As discussed, e.g., in [1, 2, 3, 9] it is a common procedure to estimate the risk of a financial 
position by means of a Monte Carlo procedure or from a sequence of historical data. Such a 
procedure makes sense when p is law-invariant: p{X) = p{X) whenever X and X have the same 
law under P. Let us denote by M.{X) := {P o X~^ : X G X} the class of all Borel probability 
measures on M that arise as the distribution of some X E X. Law invariance of a risk measure 
p : — J- M is equivalent to the existence of a map TZp : Ai{X) — )■ M such that 




p(x) = 7^,(Pox-l) 



X ex. 
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This map TZp will be called the risk functional associated with p. It can be used in a natural way 
to construct estimates for the risk p{X) oi X E X. All one has to do is to take an estimate /!„ 
for the law /i = P o X"^ of X and to plug this estimate into the functional TZp to get the desired 
estimator: 

(2.2) pn := TZpifin); 

see, e.g., [1, 2, 3, 9, 31]. For instance, /!„ can be the empirical distribution ^ ^^=1 S^,, of a sequence 
Xi, . . . ,Xn of historical observations or Monte Carlo simulations. 

Example 2.1. In this very basic example we take X = L} and p{X) = — E[X]. Then p is a 
law-invariant coherent risk measure and TZpip) is simply the negative mean of the measure p, i.e., 
TZpip) = — / xjii^dx). 

There are two natural questions that arise in this context. The first question refers to the 
consistency of a sequence of estimates p„. That is, under which conditions do we have p„ — )■ p{X) 
as n t c>o? When we assume that the estimates pn converge to p in some suitable topology on the 
space of measures, then the consistency of pn boils down to establishing the continuity of IZp in 
that topology. We will thus also analyze the continuity properties of IZp. 

Once consistency and continuity have been established, one can investigate the robustness 
of the estimate p„. Informally robustness refers to the stability of p„ with respect to small 
perturbations of the law under which the data points generated. The issue of 

robustness of this plug-in method for risk measures was first raised in [9] . Here we will address it 
in Section 2.4. 

Before stating our results, we need to specify the setting in which we are going to work. A 
common choice for X is the space L°° := L°°(f2, J-", P) of all bounded random variables. When 
dealing with possibly unbounded risks, however, the choice X = L°° is not suitable. It was 
observed in [4, 6] that Orlicz spaces or Orlicz hearts may be appropriate choices for X when 
risks may be unbounded. Let us thus recall the basic notions of Orlicz spaces. Following [6], a 
Young function will be a left-continuous, nondecreasing convex function ^ : M+ — )■ [0, oo] such 
that = ^E'(O) = lim^.|o '^{x) and lima,-|-oo '^{x) = oo. Such a function is continuous except possibly 
at a single point at which it jumps to +oo. The Orlicz space associated with is 

:= L'^(fi,J^,P) = {X G L° : E[*(c|X|)] < oo for some c > O}. 

It is a Banach space when endowed with the Luxemburg norm, 

\\Y\\^ := inf {A > : E[*(|F|/A)] < 1}. 

The Orlicz heart is defined as 

ff* := iy*(l],J^,P) = {X G L° : E[^(c|X|)] < oo for all c> O}. 

When takes the value -|-oo, then iJ"^ = {0} and = L°°. For this reason, we will mainly 
focus on the case in which \l/ is finite. Then C C L"^ C L^, and these inclusions may all 
be strict. In fact, the identity = holds if and only if \l/ satisfies the so-called A2-condition, 

(2.3) there are C, Xq > such that ^'(2x) < C^(x) for all x > Xq; 

see [14, Theorem 2.1.17 (b)]. This condition is clearly satisfied when specifically \E'(x) = x^/p for 
some p G [1, oo). In this case, H'^ = L* = and \\Y\\<s, = p~^^'P\\Y\\p. 
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Example 2.2 (Risk measure based on one-sided moments). The risk measure based on one-sided 
moments is defined as 

(2.4) p{X) := -E[X] + aE[((X - ¥.[X])-YY'p, 

where p G [l,oo) and a G [0, 1] are constants; see also [12]. It is well-defined and finite on L^, 
law- invariant, and it is easily seen that it satisfies the axioms of a convex risk measure. <C> 

When the A2-condition (2.3) is not satisfied, then the Orlicz heart rather than the Orlicz space 
L* is the natural domain for a convex risk measure as is illustrated by the following examples. 

Example 2.3 (Entropic risk measure). The entropic risk measure is defined as 

(2.5) p(X):=ilogE[e-^^], 

where /3 is a positive constant; see [17, Example 12]. It is well-defined and finite on the Orlicz 
heart for the Young function \E^(x) = e^ — 1, but it is clearly not finite on the entire Orlicz space 
. The associated risk functional TZp : A^(if*) — >■ M is given by TZpif^) = ilog / e~^^ fx{dx). <) 



Example 2.4 (Utility-based shortfall risk). The utility-based shortfall risk measure with loss 
function i was introduced in [17] as 

(2.6) p{X) := inf{m G R : E[i{-X - m) ] < Xo} 

for X G L°°, where £ : M — M is nondecreasing, not identically constant, and Xq belongs to the 
interior of ^(M); see also Section 4.9 in [18]. By taking i{x) = e^^ we recover the entropic risk 
measure (2.5). In the general case, we can define a finite Young function \E'(x) := l{x) — £(0) for 
a; > 0. With this choice, p{X) is well-defined and finite for each X G . Indeed, we have 

(2.7) < i{-X -m)< ^£(-2X) + i^(-2m) < ^^(2|X|) + U>{2\m\) + £(0), 

which implies that E[£(— X — m) ] is finite for m G M and X G . It is now easy to see that p is 
in fact a convex risk measure on . But when \1/ does not satisfy the A2-condition (2.3), then 
p{X) need not be finite for each X G . "0 

2.2 Consistency 

Let p be a law-invariant convex risk measure on if*, where is a finite Young function. We 
start by discussing the consistency of estimating the risk p{X) from a stationary and ergodic 
sequence Xi,X2,... in if* (see [5, Section 6.7] for the definition of a stationary and ergodic 
process). This is a natural question if one wishes to estimate p{X) from historical data or from 
Monte Carlo simulations, where X is a random variable with the same law as Xj. Recall that 
every i.i.d. sequence is stationary and ergodic, and that ergodicity is implied by various mixing 
conditions. 
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We denote by 



(2.8) 



m. 



•n 




i=l 



the empirical distribution of Xi, . . . , X„ and by 
(2.9) Pn := ■ 



TZpifhn) 



the corresponding estimate for p{X). 

Theorem 2.5. Suppose that p is a law-invariant convex risk measure on and Xi,X2, ... is 
a stationary and ergodic sequence of random variables with the same law as X E . Then (2.9) 
is a consistent estimator for p{X) in the sense that p„ — )■ p{X) P-a.s. 

In Theorem 2.5 it is essential that the sequence Xi,X2,... satisfies a strong law of large 
numbers such as Birkhoff's ergodic theorem or Kolmogorov's law of large numbers. Perhaps 
surprisingly, it will in general not suffice to take just any reasonable estimating sequence (Jin) for 
PoX~^ to obtain the consistency 7lp{pn) p{X). This is due to the possible failure of continuity 
of the map p i— )■ TZp{p) when \1/ is not chosen suitably. We will give a precise meaning to this 
in our Theorem 2.7, where we analyze the continuity properties of the map p i— )■ TZp{p). These 
continuity properties will also be crucial for our subsequent discussion of the robustness of the 
estimators (2.9). 

2.3 Continuity properties of 'K.p 

The basic issue when discussing the continuity of TZp can already be observed in Example 2.1. 
There the map TZp{p) = — J x p{dx) is not continuous with respect to the standard weak topology 
of measures. We therefore need to use a stronger topology, a fact that was already observed in 
[31]. More precisely, we will consider the ip-weak topology associated with a weight function ip, 
i.e., a continuous function t/; : M — )■ [0, oo) satisfying ip > 1 outside some compact set. We denote 
by Mf := Mt(R) the class of all probability measures /i on M for which J ip dp < oo. It coincides 
with the set A^i := A^i(M) of all probability measures on M if and only if is bounded. 

Furthermore, C^(M) will denote the space of all continuous functions / on M for which 
s^PxeK l/(^)/(l + '^(^))l < The ip-weak topology on Aif is the coarsest topology for which 
all mappings /it—)- J f dp with / G C^(M) are continuous; cf. Section A. 6 in [18]. Clearly, the 
■^-weak topology is finer than the weak topology, and the two topologies coincide if and only if ip 
is bounded; see Appendix A. 2 for details. When \& is a Young function, then \1/(| ■ |) is a weight 
function, and we will simply write Aif in place of A^f^' '^ We will also use the term \l/-weak 
convergence instead of \1/(| ■ |)-weak convergence etc. We recall the notation 



for the class of all laws of random variables X G H . 

Remark 2.6. For any Young function the identity Ai{H^) = Aif holds if and only if \1/ 
satisfies the A2-condition (2.3). Indeed, since the underlying probability space is atomless, A^f 
coincides with the set of the laws of all random variables X with E[^'(|X|)] < oo. But by [14, 
Theorem 2.1.17] this class of random variables coincides with i/* if and only if the A2-condition 
holds. 



M{H^) = {FoX^' : X e H"^} 
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Theorem 2.7. For a finite Young function the following conditions are equivalent. 

(a) For every law-invariant convex risk measure p on , the map IZp : M.{^H^) — )■ M zs 
continuous for the '^-weak topology. 

(b) \1/ satisfies the A2-condition (2.3). 

Remark 2.8. Fixp G [1, oo) and let "^pix) = x^/p. According to [29, Theorem 7.12], the \[^p-weak 
topology is generated by the Wasserstein metric of order p, 

(ii4/p(/U, z/) := I J \^ ^ y\^ "^{dx, dy)^ : tt G A^i(M x M) with marginals z/|. 

Since \E'p satisfies the A2-condition (2.3), Theorem 2.7 implies that TZp is continuous with respect 
to dwp whenever p is a law-invariant convex risk measure on L^. A corresponding result for p = oo 
is stated in [31, Lemma 2.4]. O 

In Theorem 2.7, a risk measure p is given on some Orlicz heart if*, and it is shown that TZp is 
continuous with respect to the ^'-weak topology. But one could ask whether Tip is even continuous 
with respect to a weaker topology. For instance, this would be the case when p can be extended 
to a law- invariant convex risk measure on a larger Orlicz heart iJ* D iJ*. 

To address this question, we consider the generic situation in which p is a law-invariant convex 
risk measure on and let 



denote the unique extension of p that is convex, monotone, and lower semicontinuous with respect 
to the L^-norm. The existence of such an extension was proved in [16]. When p is finite on some 
Orlicz heart , it will be continuous on if* with respect to the corresponding Luxemburg norm 
by [6, Theorem 4.1], and so it will also be cash additive on if* when p is cash additive on L°°. 

Theorem 2.9. Suppose that p is a law-invariant convex risk measure on L°° . Let furthermore be 
a Young function satisfying the A2-condition (2.3). Then the following conditions are equivalent. 

(a) p is finite on if*. 

(b) The map TZp : A^(ii*) — )■ M zs continuous for the '^-weak topology. 

(c) The map TZp : Ai{L°°) M. is continuous for the '^-weak topology. 

(d) If (Xn) is a sequence in L°° with ||^ri||* — > 0, then p{Xn) — )■ p(0). 



(2.10) 




1 



7 



2.4 Qualitative and comparative robustness 

Informally, Hampel's classical concept of qualitative robustness of an estimator requires that a 
small change in the law of the data results in only small changes in the law of the estimator. For 
a precise statement, it will be convenient to assume that the data arises from an i.i.d. sequence 
of random variables (Xi). We can then assume without loss of generality that the underlying 
probability space has a product structure: Q = M^, Xi{uj) = u){i) for u E Q and i G N, and 
J-" := cr(Xi, X2, . . .). For any Borel probability measure fi on M, we will denote by 



the corresponding product measure under which the Xj are i.i.d. with common distribution /x. 
Then J-", P^) is atomless unless yU. is a Dirac measure. We will retain the definitions (2.8) and 
(2.9) for the empirical distribution m„ and for the corresponding risk estimate p„ in this setting. 
We can now state the following version of Hampel's definition of qualitative robustness, suitably 
adapted to our more general setting. 

Definition 2.10 (Qualitative robustness). Let A/" C A4i be a set of probability measures. Let 
furthermore (Ia be a metric on A/" and ds he a. metric on A^i. Then the the risk functional TZp is 
called robust on M with respect to dA and ds if for all G A/" and e > there exists 5 > and 
riQ G N such that 

(2.11) ueM, dA{fi,iy)<S ^ rfB(P,.op„-i,P,op„-i) <£ forn > no. 

In the classical literature on qualitative robustness [19, 20, 11, 24] and in [9], the distances dA 
and ds are chosen so that they generate the weak topology of measures. For instance, they are 
taken as the Prohorov or Levy metrics, and A/" is taken as A^i. It is a consequence of Hampel's 
celebrated theorem that such a choice essentially limits the concept of robustness to risk functionals 
that are continuous for the weak topology; see, e.g., [20, Theorem 2.21]. 

Insisting on metrics for the weak topology when assessing the robustness of a risk functional 
TZp is problematic for a number of reasons. 

First, two distributions n and u can be rather close with respect to a distance dA for the weak 
topology, but still have completely different tail behavior. In this case, robustness (2.11) requires 
that TZpifi) is insensitive to the tail behavior of /i. This can be seen immediately, e.g., from the 
definition of the Levy metric, 

c?Levy(/i, i^) = iuf > : Fp{x — e) — e < Fj,{x) < Fp{x + e) + e for all x}, 

where Fp and Fj, are the distribution functions for /i and u. In the recent years of financial crisis, 
it has become apparent, though, that a misspecification of the tail behavior of a P&L can lead to 
a dramatic underestimation of the associated risk. 

Second, linking the terminology of robustness exclusively to the weak topology generates a 
sharp but artificial division of risk functionals into the class of those that are "robust" and another 
class of those that are "not robust" . The first class contains risk functionals such as Value at Risk 
which are insensitive to the tail behavior of P&Ls, whereas the second class contains, e.g., the 
ordinary expectation and all law- invariant coherent risk measures [9]. This classification thus 
seems to provide a strong argument in favor of Value at Risk and against coherent risk measures. 
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We will see, however, that the distinction between "robust" and "non-robust" risk functionals is 
artificial because there is actually a full continuum of possible degrees of robustness beyond the 
classical concept. This new look at robustness will then help us to bring the argument against 
coherent risk measures back into perspective: robustness is not lost entirely but only to some 
degree when Value at Risk is replaced by a coherent risk measure such as Average Value at Risk. 

It was observed in [22] that the basic idea for addressing this problem is to choose suitable 
metrics in (2.11). For ds we will take the Prohorov distance, which is defined as 

rfproh(yU, 1^) ■■= inf {e>0 : fi{A) < u{A') + e for all A e B{R)}. 

Note that (iproh > c^Levy, SO our choice ds '■= (iproh gives a stronger notion of robustness than the 
choice ds '■= (iLevy, which would just as well be possible. For dA, we will choose the Prohorov 
ip-metric, 

which by Lemma A. 3 metrizes the ip-weak topology on Ai^ for a given weight function tp. Also in 

(2.12) we could have replaced ciproh by (iLcvy, but here the advantage of dp^-oh is that it extends to 
a multivariate setting; see [22]. When iIj{x) = ^(|x|) for some Young function we will simply 
write d\fj instead of c?<i'(| |)- We will also need the following notion. 

Definition 2.11. Let he a. weight function. A set A/" C Aif is called uniformly ip -integrating 
when 

(2.13) lim sup / ijjdu = 0. 

When ijj is bounded, every set A/" C A^f = A^i is uniformly ?/^- integrating. But when ijj is not 
bounded, then the uniform continuity in (2.11) will typically hold only on uniformly integrating 
sets A^. Let us therefore introduce the following concept of robustness. 

Definition 2.12. Let he a, weight function and Ai C A/lf. A risk functional TZp is called 
■ip-robust on Ai when TZp is robust with respect to d^ and (iproh on every uniformly '0-integrating 
set J\f C M. 

We can now state the following preliminary result. 

Proposition 2.13. Let TZp be the risk functional associated with a law-invariant convex risk 
measure p on L°° . When ip : M+ — (0, oo) is a nondecreasing function such that TZp is ■ |)- 
robust on M.{L°°), then ip has at least linear growth: lim inf ^-|-oo (2^) /^^ > 0. 

The significance of the preceding proposition is that it allows us to essentially limit the analysis 
of the ■j/'-robustness of risk functionals to weight functions ip{x) = ^I/(|a;|) arising from a Young 
function In this context, we have the following result. 

Theorem 2.14. For a finite Young function the following conditions are equivalent. 



(2.12) 



d^{fi, v) := dprohip, 1^) + 



ip dp — I ip dv 
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(a) For every law-invariant convex risk measure p on , TZp is '^-robust on A4.{H^). 

(b) \I/ satisfies the A2-condition (2.3). 

As in Theorem 2.9, H"^ may not be the "canonical" space for p in the sense that p can be 
extended to a larger space. Such a situation has an impact on the robustness of p as explained in 
the next result. By p we denote again the extension (2.10). 

Theorem 2.15. Let be a Young function satisfying the A2-condition (2.3). For a law-invariant 
convex risk measure p on L°° , the following conditions are equivalent. 

(a) TZ-p is '^-robust on A^f . 

(b) Tip IS '^-robust on M{L'^). 

(c) p is finite on . 

The most important aspect of Theorem 2.15 is that it allows us to study the robustness 
properties of a given risk functional on M.{L°°) rather than on its full domain. Since any risk 
functional that arises from a law-invariant convex risk measure is defined on M.{L°°), we can thus 
compare two risk functionals in regard to their degree of robustness. 

Definition 2.16 (Comparative robustness). Suppose that pi and p2 are two law-invariant convex 
risk measures on L°°. We will say that pi is at least as robust as p2 if the following implication 
holds. When is a Young function satisfying the A2-condition (2.3), and TZp2 is ^'-robust on 
M{L°°), then TZp-^ is \&-robust on M{L°°). When, in addition, there is a such that TZp^ is 
^f-robust on A4(L°°) but TZp^ is not, then we will say that pi is more robust than p2- 

We immediately get the following corollary. 

Corollary 2.17. For two law-invariant convex risk measures pi and p2 on L°° , the following 
conditions are equivalent. 

(a) pi is at least as robust as p2. 

(b) When the Young function \1/ satisfies the A2-condition (2.3) andj)2 is finite on , thenj)-^ 
is also finite on . 

Instead of comparing the robustness of two risk measures with respect to all possible weight 
functions ip, it makes sense to restrict the attention to the weight functions ipp{x) = \x\^/p for 
< p < oo. This leads to the following notion that was first proposed in [22] within a more 
general context. 

Definition 2.18 (Index of qualitative robustness). Let p be a law-invariant convex risk measure 
on L°°. The associated index of qualitative robustness is defined as 



iqr(p) = ( inf [p G (0, oo) : TZp is ?/;p-robust on M{L°°) 




10 



It follows from Proposition 2.13 that any law-invariant convex risk measure p satisfies iqr(p) < 
1. Thus, Theorem 2.15 implies that 

(2.14) iqr(p) = (^inf {p G [1, oo) : p is finite on L^} j 

Our next result shows that the index of qualitative robustness is a convenient tool for comparing 
the qualitative robustness of two law-invariant convex risk measures. 

Theorem 2.19. Suppose that pi, p2 are two law-invariant convex risk measures such that iqr(pi) > 
iqr(p2)- Then pi is more robust than p2- 

Example 2.20 (Risk measure based on one-sided moments). For the risk measure p defined in 
(2.4) with p G [1, oo) and a > we obviously have iqr(p) = 1/p. <C> 

2.5 Index of qualitative robustness for distortion risk measures 

We now turn to the important example class of distortion risk measures defined as 

pO pod 

(2.15) p,(X) := / giFxiy)) dy - (l - giFx{y)) dy, 



where Fx denotes the distribution function of X, and g is a. nondecreasing function such that 
g{0) = and g{l) = 1; see e.g. [32, 13, 30, 23, 18]. It is a consequence of a theorem by Dellacherie 
and Schmeidler that Pg is a law-invariant convex risk measure on L°° if and only if g is concave 
(see Proposition 4.75 and Theorem 4.94 in [18] for a proof). In this case, pg is even coherent and 
can be represented as 

(2.16) pg(X) = ^(0+)esssup(-X) + /" V@Rt{X)g'^{t) dt, X G 

Jo 

where g'_^ is the right-hand derivative of g and V@Rt(X) := — inf{y : Fx{y) > t} denotes the 
Value at Risk at level t; see, e.g., [18, Theorem 4.70]. It was pointed out in [9] that in this case p 
cannot be qualitatively l-robust. On the other hand, the risk functionals of two different concave 
distortion risk measures may have a rather different behavior in regard to qualitative robustness. 
This is the content of the following result. 

Proposition 2.21. Suppose that g is concave and continuous. Then, forp G [1, oo) and ^ + ^ = 1, 
the extension Pg of Pg is finite on if and only if g'_^ G ^^'^(0, 1). In particular, 

iqr(pg) = - — - — where q* = sup |g > 1 / {g'_^_{t))'' dt < oo|. 

q* ^ Jo ^ 

Example 2.22 (Average Value at Risk). The risk measure Average Value at Risk at level a G 
(0,1), AV@Rq,, is given in terms of the concave distortion function gi{t) = (t/a) A 1 (see [18, 
Example 4.71]). AV@R is also called Expected Shortfall, Conditional Value at Risk, or TailVaR. 
Since g[ is bounded, it follows from Proposition 2.21 that iqr(AV@RQ) = 1. 

More generally, we can consider the distortion function gpit) = (t/a)^ A 1 for some (3 G (0, 1]. 
It is easy to see that the corresponding risk measure Pg^ has iqr(pg^) = /3. It follows already from 
this simple example that distortion risk measures cover the whole possible range of our index of 
qualitative robustness. <0> 
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Example 2.23 (MINMAXVAR). In [7, 8] the concave distortion risk measures MINVAR, MAX- 
VAR, MINMAXVAR, and MAXMINVAR were introduced. They play an important role in conic 
finance [8]. For instance, MINMAXVAR is defined in terms of the concave distortion function 

g,^^(t) = 1 - (1 - tThy^\ 

where A and 7 are nonnegative parameters. An easy computation shows that g'\^{t) ~ c ■ 

as t i 0, and so we have iqr(MINMAXVAR) = j^. ' 

3 Some general results 

In this section, we will state and prove some theorems that are crucial ingredients for the proofs 
of the results from Section 2. These theorems and propositions are interesting in their own right 
and stated in the general contexts of robust statistics and ip-weak convergence. 

3.1 Hampel-type theorems for the ?/?-weak topology 

Hampel [19] introduced the notion of qualitative robustness with the weak topology in mind. In 
Section 2.4 we have argued that it is necessary to replace the weak topology with a finer ip-weak 
topology so as to obtain a more balanced picture of the robustness of a risk functional. Such an 
approach was first suggested in [22]. In this section, we will give versions of Hampel's theorem 
and its converse for the ip-weak topology that are slightly stronger than the corresponding results 
in [22]. We need them as basis for the results in Section 2.4, but we will state them here in the 
framework of robust statistics rather than in the narrower context of risk functionals. For the 
sake of consistency with the preceding sections, we have chosen a one-dimensional setting, but we 
could just as well have stated our results in the even more general multivariate framework of [22]. 

As in Section 2.4, we consider the canonical product space Q := with Borel field J-" and 
coordinate mappings which become i.i.d. random variables under a product measure = 

fi^^. A statistical functional will be a map T : — )■ M, where Ai G Aii must contain all 
measures of the form ^ Ylk=i for G N and Xi, . . . , x„ G M. It gives rise to a sequence of 
estimators given by := Tijhn), where m„ is the empirical distribution of Xi, . . . , X„ as in (2.8). 
Clearly, the risk functional TZp associated with a law-invariant risk measure p is an example of a 
statistical functional. Also, recall from Definition 2.11 the notion of a uniformly ^/^-integrating set. 
The following definition is a modified version of [22, Definition 2.1]. 

Definition 3.1 (^/'-robustness) . Let T be a statistical functional and be a subset of A^f . Then 
T is called 'i/^-robust at in A4 if for each 5 > and every uniformly ^/^-integrating set A/" C 
with /i G A/" there are 5 > and rio G N such that 

V eN, d^{p,v)<5 d^,o^{¥^of~\¥,of~^)<e forn > no. 

The following theorem provides a version of Hampel's theorem that is stronger than [22, 
Corollary 3.6], the corresponding result in [22]. 

Tiieorem 3.2 (Hampel's theorem for the -i/i-weak topology). Let T : Al — t- M 6e a statistical 
functional where M. C Alf . When T : Al — > M ip-weakly continuous at fi G Al, then T is 
ip-rohust at fi in Ai. 
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Proof. The result will follow from [22, Theorem 2.4] when we can show that every uniformly ip- 
integrating set Af has the following uniform Glivenko-Cantelli (UGC) property: for each e > 
there are 5 > and no € N such that 

(3.1) sup ¥y\^d^{u,mn) > 5] < £ for n > tiq. 

According to [24, Lemma 4], the set M.^ has the UGC property for the Prohorov metric. Therefore, 
the UGC property (3.1) follows from (2.12) and Chung's uniform weak law of large numbers ([10]; 
see also [28, Proposition A. 5.1]) applied to the sequence (?/'(X„)) of random variables; notice that 
/Vcim„ = iE:ii^A(X,)and ^ dv = ^,\^{X^)\. □ 

The following result may be viewed as a converse of Hampel's theorem for the ?/;-weak topology. 
Together with Theorem 3.2 and with the choice ^ = 1 and AA = Aii it yields the classical Hampel 
theorem in the form of [20, Theorem 2.21]. Its statement uses the following notion of consistency: 
a statistical functional T is called weakly consistent at ^ ^ Ai when T„ — )■ T(/i) in P^j-probability. 

Theorem 3.3 (Converse of Hampel's theorem for the ip-weak topology). Suppose that T : — t- M 
is a statistical functional where M. C A^f . Let /i G and 6q > Q he given, and suppose that 
T is weakly consistent at each v in Ai with (i^(i/, /i) < 5q. When T is ip-robust at fi in M., then 
T : — 7- M zs ip-weakly continuous at fi. 

Proof. We must show that T{fik) T{fi) when (fik) is a sequence in Al that converges ip-weaklj 
to fi. Given such a sequence (/ifc), the set Af := {/^, yiii, //2, • • •} is clearly compact for the ip- 
weak topology. By Lemma A. 4 we conclude that Af is uniformly ■^/'-integrating. So, given e > 0, 
the ■i/'-robustness of T at /i in Al implies that there are some 6 > and no G N such that 
dpToh(J^^ik ° T~^,F^ o T~^) < e for all n > no and k > k^, where /cq G N is chosen such that 
d^il^k^ Ij) < ^ for all k > kf) (recall that d^ generates the ip-weak topology). So, assuming without 
loss of generality S < 6o, the weak consistency of (T„) at /ifc implies that 

|T(/ifc) -T(/i)| = dprohi5T{^,),6T{^,)) 

< limSUp (rfproh(5T{^.fe), P^fc O f~^) + 6 + dproh(PM O T.^^ 5t(a.))) 

= e 

for all k > ko- This completes the proof. □ 
3.2 Skorohod representation for ^-weak convergence 

The classical Skorohod-Dudley-Wichura representation theorem states that weak convergence 
fin fJ'O is equivalent to the existence of random variables Xn with law /i„ such that X„ — Xq 
almost surely. A question one may ask is whether ip-weak convergence /i„ — )■ fiQ can be expressed 
in terms of a stronger concept for the convergence X„ — )■ Xq. Here we are going to address this 
question in the context of the Orlicz spaces. 

Theorem 3.4. For any finite Young function \& the following two conditions are equivalent. 

(a) A sequence (fin) in AA.{H^) converges '^-weakly to some fiQ if and only if there exists a 
sequence (X„)„gNo in- such that Xn has law fin for each n and — Xo||iii — )■ 0. 
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(b) \I/ satisfies the A2-condition (2.3). 

For proving Theorem 3.4 we need the following lemma. 

Lemma 3.5. Let ^ he a Young function satisfying the A2-condition (2.3) and let (-^n)nGNo a 
sequence in H"^ = L*. If the sequence {'^{\Xn\))n€No uniformly integrable, then the sequence 
(\1/(2™|X„ — Xo|))„eN is also uniformly integrable for every m e Nq. 

Proof. By (2.3), we have C := sup^^^^ \E'(2x)/\E'(x) < oo for some xq > 0. We proceed by induction 
on m G Nq. 

First, let m = 0. Since is nondecreasing and convex with ^1^(0) = 0, we obtain by the triangle 
inequality 

^(l^n-Xol) < l(vl/(2|X„|) + vl/(2|Xo|)) 

= i(M/(2]l[o,.,](|Xj)|X„|) + M/(2]l[o,..„](|Xo|)|Xo|)) 

+ ^(vl/(2]l(.„,^)(|X„|)|X„|)+M/(2]l(.„,^)(|Xo|)|Xo|)) 
< vl;(2xo)+ |(vl^(|X„|) + vl;(|Xo|)). 

Since the sequence (\l/(|X„|))„gNo is uniformly integrable by assumption, we may thus conclude 
that the sequence (\Ef(|X„ — Xo|))„eN is uniformly integrable. 

Let us now suppose that (\1/(2™|X„ — X'o|))„gN is uniformly integrable for any given m G Nq. 
Following an analogous line of reasoning as in the case of m = 0, we may find 

^(2"^+i|X„-Xo|) = ^(2-2™|X„-Xo|) < *(2xo) + C*(2'"|X„-Xo|). 

Hence, (\E'(2™"'"^|X„ — Xo|))„gN is uniformly integrable, which completes the proof. □ 

Proof of (b)^(a) in Theorem 3.4- Let us suppose that the A2-condition (2.3) holds. 

We first prove that \\Xn — Xq\\^ — )■ implies fin f^o ^'-weakly. Due to Proposition 2.1.10 in 
[14], \\Xn - Xo\\q, yields E[ \1/(2|X„ - Xo|) ] and X„ Xq in probability. Convexity and 
monotonicity of \E' imply that 

< ^(|X„|) < ^*(2||X„| - |Xo||) + ^*(2|Xo|) < ^^(2|X„ - Xo|) + ^^(2|Xo|). 
Hence, dominated convergence yields that 

j ^i\x\)finidx)=E[^{\Xn\)]^E[^{\Xo\)] = j ^{\x\)fio{dx). 

Moreover, since X„ — Xq in probability the corresponding laws (/i^) converge weakly. Now the 
^-weak convergence /i„ — fiQ follows from Lemma A. 3 (iv)^(i). 

Now we prove that the ^'-weak convergence /i„ — ?■ /iq implies the existence of a sequence (X„) 
in such that ||X„ — XqH^- — )■ 0. Clearly, /i„ — yUo weakly. By Skorohod representation there 
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hence exists a sequence of random variables such that X„ — j- Xq P-a.s. The continuity of ^ 
and the fact that ^1/(0) =0 yield that 



(3.2) 
(3.3) 



*(|X„|)-^*(|Xo|) P-a.s. 

^(klXn - Xo\) — > P-a.s. for all A; > 0. 



Moreover, the \l/-weak convergence fin /^o implies that 



(3.4) 



mi\Xn\)\ 



^(|x|)/i„(rfx) 



^{\x\)fio{dx) = E[^{\Xo\)]. 



Now, (3.2), (3.4), and Vitali's theorem in the form of [21, Proposition 3.12 (ii)^(iii)] imply that 
the sequence (\l/(|X„|))„gN(, is uniformly integrable. Applying Lemma 3.5 yields the uniform 
integrability of the sequence (\l/(/c|X„ — Xo|))„gN for every k > 0. Therefore, (3.3) and another 
application of Vitali's theorem, this time in the form of [21, Proposition 3.12 (iii)^(ii)], yield 
E[\&(/c|X„ — Xo\)] — )■ for every k > 0, which implies \\Xn — Xo||* — according to [14, 
Proposition 2.1.10 (5)]. Finally, the sequence {Xn) belongs to if*, because under the A2-condition 
if* coincides with the class of random variables Y with E[\l/(|y|) ] < oo. □ 

Proof of (a.) ^{h) in Theorem 3.4- Let us suppose that condition (a) in Theorem 3.4 holds, but that 
\l/ does not satisfy the A2-condition (2.3). We will show that this leads to a contradiction. Since 
^ does not satisfy the A2-condition (2.3) and our probability space is atomless, we have if* ^ L* 
by [14, Theorem 2.1.17]. Hence there exists a random variable Y > such that E[\&(y)] < oo 
and E[ '${2Y) ] = oo. We then choose a„ > such that 



2E 



^ (2 (F A a„) ) > n + ^ (4n) for each n, 



and let 



Xri 



(Y - n)+ A an. 



Then Xn € L°°, and hence Xn G if* since is finite. Moreover, Xn — )■ -=0, and 

(3.5) <E[*(|X„|)] =E[$((F-n)+ Aa„)] <E[^{Y -n)] ^0 

by dominated convergence. It therefore follows from Lemma A. 3 that yU„ := P o X~^ — t- Sq in the 
\l/-weak topology. 

We will show next that we cannot have \\Xn — Xo\\^ = — )■ 0. Since any sequence (Xn) 
for which P o X~^ = fin must satisfy = condition (a) in Theorem 3.4 will thus 

be violated. So let us suppose by way of contradiction that — 0. By [14, Proposition 

2.1.10 (5)], this is equivalent to E[ \1/(A;|X„|) ] — )■ for every A; > 0. By taking k = A and using the 
fact that the convex function i{x) := \l/(4(0 V x)) satisfies £{x — y) > 2i{x/2) — i{y) we obtain 

E[^(4|X„|)] = E[^(4((F-n)+ Aa„))] >E[^(4(FA(a„ + n)-n) 

> 2E 

> 2E 



^ (2{Y A {an + n))j - ^(4n 
^(2(yAa„))] -*(4n) > n, 



by construction. This is the desired contradiction. 



□ 
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4 Proofs of the results from Section 2 



Proof of Theorem 2.5. Since E[\I/(A;|X|) ] < oo for each A; > 0, Birkhoff's ergodic theorem (e.g., in 
the form of [5, Theorem 6.28]) imphes that for each /c > and P-a.e. oj G f2 

(4.1) / ^(A;|x|)m„(a;)(rfx) = -^^(A;|X,(u;)|)^E[^(A;|X|)]= j ^ {k\x\) ^i{dx) , 
-> ^ i=i J 

where /i := P o X~^. Moreover, for P-a.e. w G 

(4.2) mn{<^) — y fJ' weakly, 

due to Birkhoff's ergodic theorem and an application of [25, Theorem 6.6]. Hence there exists a 
measurable set Qq E such that P[f2o] = 1 and such that for each a; G fio (4.2) is satisfied and 
(4.1) holds for each k E N. Let us fix uq G Qq. Since our probability space is atomless, standard 
Skorohod representation yields the existence of random variables X'^°, {X!^°) such that X'^° has 
law fi, X^'' has law m„(wo), and X^" X'^o P-a.s. By (4.1), for each A; G N, 

(4.3) E[*(A;|X;;^°|)] = J ^{k\x\) fhnMidx) ^ *(A;|x|) /i(c/x) = E[^(A;|X^°|) ]. 

Therefore the sequence {'${k\X!^°\)) is uniformly integrable for each /c G N. 

Now take a > and pick k E N such that k > 2a. Since \E' is convex and nondecreasing, we 
have 

< ^(a|X;^« -X"<'|) < ^(^^(2a|X7|) + ^(2a|X'^»|)) < ^(^^(A;|X7|) + ^(A;|X'"«|) 

It follows that the sequence {'^{a\X!^° — X'^"|)) is uniformly integrable. Since clearly \&(a|X^° — 
X'^o|) _^ P-a.s., we get that E[^(a|X;^« - X"^"])] for each a > and in turn that ||X;;^o - 
X'^^'ll^ — )■ (see [14, Proposition 2.1.10]). By [6, Theorem 4.1] p is continuous with respect to the 
Luxemburg norm || ■ ||^, and so 

Pn(c^o) = 7^p(m„(a;o)) = p(Xr) p{X^') = p{X) 
for each Uq eVIq. □ 

Proof of Theorem 2.1. We first prove the implication (b)=^(a) in Theorem 2.7. So let us assume 
that satisfies the A2-condition (2.3) and let p be a convex risk measure on with associated 
map Hp. To show the continuity of T^p, choose a sequence (yU.„) such that /i„ — /i \E^-weakly. By 
Theorem 3.4 there exists a sequence (X„) in such that each X„ has law and such that 
||X„ — XqII* — )■ 0. But it was shown in [6, Theorem 4.1] that p is continuous with respect to the 
Luxemburg norm || ■ ||^ (see, also [26, Proposition 3.1]). Therefore, 

Tlp{Pn) = p{Xn) > p(Xo) = 7^p(/io), 

which proves the implication (b)^(a). 

We now prove the implication (a)^(b) in Theorem 2.7. This proof is similar to the proof of 
(a)^(b) in Theorem 3.4. We assume that \1/ does not satisfy the A2-condition (2.3), and we will 
construct a risk measure p for which TZp is not \&-weakly continuous. This risk measure is given 
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as the utility-based shortfall risk measure (2.6) with convex loss function i{x) := \E'(0 V (8a;)). It 
follows as in (2.7) that E[£(— X — m) ] is finite and well-defined for m G M and X G i/*. 

Since \1/ does not satisfy the A2-condition (2.3) and our probability space is atomless, we 
have if* ^ L* by [14, Theorem 2.1.17]. Hence there exists a random variable Y > such that 
E[^(F) ] < oo and E[^'(2'K) ] = oo. We then choose a„ > such that 



(4.4) 



4E 



^ (2 (F A a„) ) >n + i{n/2) for each n, 



and let 

X„:=-((F-n)+Aa„). 

Then Xn G L°° and hence X„ G if* since is finite. As in (3.5) we get E[^'(|X„|)] 0, and 
so /i„ := P o X~^ — )■ So in the \l/-weak topology. We now prove that the sequence Zn '■= pi^n) = 
lZp{jjin) is unbounded, which will imply that IZp is not continuous for the \l/-weak topology. 

To prove that the sequence {z^) is unbounded, we assume by way of contradiction that z* := 
sup„2;„ < oo. We see from (2.7) and dominated convergence that each Zn = pi^n) solves the 



equation E[^(— X„ 



1. The convexity of i implies that i{x — y) > 2£(x/2) — i{y). Hence, 



E[e{-Xn- Zn)]>E[i{-Xn- z*)] > 2E £[ {{Y -ny Aan) 



> 2E 

> 4E 
= 4E 



£(l(rA(a„ + n))-^ 



i{z* 



£(-(FAa„))] - iin/2) - i{z*) 
^(2(yAa,)) 1 -i{n/2)-i{z*). 



But according to (4.4), the expression on the right is bounded from below hj n — i{2z*), which 
yields the desired contradiction. □ 

Proof of Theorem 2.9. The equivalence between conditions (a) and (b) follows from Theorem 2.7. 
The implication (b)^(c) simply follows from the fact that p is equal to the restriction of p to 

To prove (c)=^(d), we first note that := P o X„ G A^(L°°) when {Xn) is a sequence as in 
(d). Moreover, — implies that /i„ — > 5q \l/-weakly. So it is now clear that (c) implies (d). 

We now prove (d)^(a). To this end, we will apply [6, Theorem 4.3], which states that p 
is finite on when belongs to the topological interior of the effective domain of the map 
p : — 7- M U {+oo}. We will therefore show that p is finite on the centered e-ball := {X G 
: < e} when e > is small enough. 

Suppose that (X„) is a sequence in L°° such that — 0. Then /i„ := P o X~^ — )■ 5q 

^-weakly, and so p(X„) = 7^p(p„) — )■ 7^p(5o) = p(0). Thus, p : L°° — M is continuous with respect 
to II • \\^ at 0. Hence, for ii' > given, there exists e > such that p{X) < K ior X E Ci L°°. 
Now let us fix X G -B^. The negative part X~ belongs again to B^, and monotone convergence 
yields X~ A /c — )■ X~ in L^. Using the lower semicontinuity of p : — )■ M U {+00} hence gives, 

p(^) < Pi-X~) < liminf p(-X- A k) = liminf p(-X- A A;) < K. 

Here we have also used the monotonicity of p in the first and the fact that —{X~ A k) E B^n 
in the final step. □ 
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Proof of Proposition 2.13. We prove the assertion by way of contradiction. So let ip : IR+ — )■ (0, oo) 
be a nondecreasing function such that hminfa,foo ip{x)/x = and suppose that Tip is ip{\- |)-robust 
on Ai{L°°). By Theorem 2.5, TZp is consistent at each /i G Ai{L°°). Hence Theorem 3.3 and the 
robustness of TZp imply the continuity of TZp on Ai{L°°) with respect to ipd ■ |)-weak convergence. 
As in the proofs of Theorems 3.4 and 2.7 we will construct a sequence C A^(L°°) that 

converges ■ |)-weakly to 6o but for which 7lp{fin) 7^ Tlpi^o)- To this end, we easily construct 
a random variable y > such that E[?/'(y)] < 00 and E[y] = 00 and pick a„ > such that 
E[F A a„] > 2n. Then X„ := (Y - n)+ A a„ ^ P-a.s. and E[ij{xJ)] ^ by dominated 
convergence. Hence, /i„ := P o converge ■ |)-weakly to 60 by Lemma A. 3. However, 
E[Xn] >^Y A an] - n > n. Now [27, Lemma 2.3] yields that 

Upifin) = p{Xn) > p(E[X„]) > p(n), 

which shows that we cannot have 7lp{fin) — ?■ 7lp{So). □ 

Proof of Theorem 2.I4. (a)^(b): By Theorem 2.5, TZp is consistent at each /i G A^(if*). Hence 
Theorem 3.3 and the robustness of TZp imply the continuity of TZp on A^(iJ*) with respect to 
\E'-weak convergence. Thus, due to Theorem 2.7, must satisfy the A2-condition (2.3). 

(b)^(a): By Theorem 2.7, TZp is a continuous map on A^(if*) = A^f . Its \&-robustness on 
Aif therefore follows from Theorem 3.2. □ 

Proof of Theorem 2.15. The implication (a)=^(b) is obvious. 

(b) ^(c): First, we note again that TZp is consistent on M.{L°°) by Theorem 2.5. Therefore, 
Theorem 3.3 and the robustness of TZp imply the ^f-weak continuity of TZp on Ai{L°°). Theorem 
2.9 now yields (c). 

(c) ^(a): Condition (c) implies that p is a convex risk measure on . Hence, (a) follows from 
an application of Theorem 2.14. □ 

Proof of Theorem 2.19. Obviously, it suffices to show that pi is at least robust as p2- For this 
purpose let ^ denote any finite Young function satisfying the A2-condition (2.3) such that p2 is 
finite on . 

In view of Lemma A.l and Lemma A. 2 (cf. Appendix A.l) we find some real number > 1 
such that W C is valid for p > p^,, and if* C L'^ holds for every q G [l,p^). Since p2 is finite 
on if*, we may conclude 

iqr(pi) > iqr(p2) > — • 
p^ 

In particular there exists some 1 < q < p^i such that p^ is finite on L^. This implies that is finite 
on if* too because if* C by choice of p^. Hence pi is at least robust as p2 due to Corollary 
2.17, and the proof is complete. □ 

Proof of Proposition 2.21. First, when g is continuous we have 5^(0+) = and in the 'spectral' 
representation (2.16) the part containing the essential supremum vanishes. Moreover, it follows 
from [16] that (2.16) remains true for p^ and X G L^. Next, the function f{t) := g'j^{l — t) is 
nondecreasing, and we have 



V@Rt{X)g'^{t)dt= [ q_x{t)f{t)dt, 
^0 
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where q-x is a quantile function for —X. 

Now suppose that / G /^'^(0, 1). By Holder's inequahty, we get that for any X G L^, 

p,(X) = j\-x{t)f{t)dt < i^j'^ \q-x{t)rdty'[j'^ fitydtf' = E[|Xr]Vf 11/11^, < oo. 

Here we have used [18, Lemma A. 23] in the third step. 

Now we assume that is finite on L^. Since our probabihty space is atomless, it supports 
a random variable U with uniform distribution on (0, 1). We will show that Y := f{U) belongs 
to L'^. According to the Banach-Steinhaus theorem this holds if and only if E[(— < oo 
for all X G U' . But for X G LF , the fact that / is a quantile function for Y and the upper 
Hardy-Littlewood inequality (e.g., [18, Theorem A. 24]) imply that 

oo>p(X)= / q,x{t)f{t)dt= f q.x{t)qY{t)dt>n{-X)Y]. 
Jo Jo 

This concludes the proof. □ 



A Appendix 

A.l Relation between Orlicz space and i7-space 

Let be a Young function satisfying the A2-condition (2.3). In this section, we discuss some 
relations between the Orlicz space L* = i7* and the standard L^-spaces. 
By the A2-condition (2.3), 

/ , ^(2y) 
aix) := sup — -— , x > 0, 

y>^ *(y) 

is a finite and nonincreasing function. Moreover, a > 2, because the convexity of implies that 
^(2x) < 2^(x). Hence we can define a(oo) := lim^^ioo a(a^) £ [2, oo). 



Lemma A.l. // 

for some Xq G a~"^(M), then U' C . 
Proof. Let Xq G a~^(M) such that 



2^ ^ ^(xq) 



a[,xo) x'q 



2^ ^ ^(xo) 



ttl^xoj xl 



It is easy to check by induction that 

(A.l) (2^x0)" > *(2^Xo) forA;GN. 

Now, for any x > 2x0, there exists a unique /c G N with k >2 and 2^^^xo <x< 2^xo- Hence, by 
(A.l) and the monotonicity of 

2V > (2'^xo)'' > ^(2^x0) > ^(x). 
Therefore, 2^x^ > \l'(x) for x > 2xo, and so [14, Theorem 2.2.3] implies C = iJ*. □ 
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In view of Lemma A.l we have C for at least one real number p > 1. Furthermore, we 
may define the number 

■= inf{p > 1 I LP C iJ*}. 
The following Lemma considers the relationship between and for 1 < p < p^,. 

Lemma A. 2. The relationship C holds for any p G 

Proof. Fix any p G [l^p^^). By definition of p^, we have V (f_ so that in view of Lemma A.l 
we may conclude 

a(xo) 

for every xq G a~^(M). Hence, for any strictly increasing, unbounded sequence in 



we obtain 



This means 



< — - — r < lim inf — - — - < lim inf 



(3{xo) := inf ^ > 

x>xo xP 



for some G a ^(l^), implying \E'(a;) > (3{xq)xP for a; > Xq. Then if* C follows from [14, 
Theorem 2.2.3]. □ 

A. 2 Auxiliary results on the ?/;-weak topology 

First recall from [18, Corollary A. 45] that the -j/^-weak topology on A^f (M) is separable and metriz- 
able. The following lemma provides some useful characterizations of the ip-weak convergence; see 
[22, Lemma 3.4] for a proof. 

Lemma A. 3. The following statements are equivalent: 
(i) /i„ — )■ /i ip -weakly. 

(a) J f djin J fdfi for every f G C^(R). 

(Hi) f f dfin f f dfi for every continuous f with compact support and for f = ip. 
(iv) fin ^ ^ weakly and f ip dfin J 'ipdfi. 

The following lemma gives a transparent characterization of the ip-weak\j compact subsets of 
A4f. Recall that a set A/" C Aif is called uniformly ^/^-integrating if it satisfies (2.13). 

Lemma A. 4. A set M C A^f is relatively compact for the ip-weak topology if and only if there 
exists a measurable function : R — )■ [0, oo) such that (j){x)/ip{x) oo as |x| —?■ oo and such that 



(A. 2) sup / (f) dfi < oo. 

In this case, Af is uniformly ip -integrating. 
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Proof. The first statement is an immediate consequence of Corollary A. 47 in [18]. To prove 
the latter statement, we assume without loss of generality that > and that tlj{x) — t- oo as 
|x| — )■ oo. Fix e > 0, and denote by K the left-hand side of (A. 2). Choosing Mi > so large so 
that il){x) / (f){x) < e/K when \x\ > Mi, and choosing Mq > so large so that tplx) > Mq implies 
|x| > Ml, we obtain 

sup / ^l){x)l^f^^)>M ^i{dx) = sup / — — a^(^)>M /i(c?a;) 



< — sup / 4>{x) fi{dx) 



for all M > Mq. That is, (2.13) holds. □ 
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